Serveur d'exploration sur la TEI

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Data-processing modeling of dynamic structures of textual segments for the analysis of corpus

Identifieur interne : 000019 ( France/Analysis ); précédent : 000018; suivant : 000020

Data-processing modeling of dynamic structures of textual segments for the analysis of corpus

Auteurs : François Daoust [France]

Source :

RBID : Hal:tel-00870410

Descripteurs français

English descriptors

Abstract

The objective of the thesis is to propose a data-processing model to represent, build and exploit textualstructures. The suggested model relies on a «type/token» form of text representation extended bysystems of lexical and contextual annotations. This model's establishment was carried out in the SATOsoftware -- of which the functionalities and the internal organization are presented. Reference to anumber of works give an account of the development and use of the software in various contexts.The formal assumption of the textual and discursive structures find an ally in the beaconing XMLlanguage and the proposals of the Text Encoding Initiative (TEI). Formally, the structures built on thetextual segments correspond to graphs. In a development driven textual analysis context, these graphsare multiple and partially deployed. Their resolution, within the fastening of the nodes to textualsegments or that of other graphs, is a dynamic process which can be sustained by various dataprocessingmechanisms. Examples drawn from textual linguistics are used to illustrate the principles ofstructural annotation. Prospective considerations for the data-processing establishment of amanagement system of the structural annotation are also exposed.

Url:


Affiliations:


Links toward previous steps (curation, corpus...)


Links to Exploration step

Hal:tel-00870410

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Data-processing modeling of dynamic structures of textual segments for the analysis of corpus</title>
<title xml:lang="fr">Modélisation informatique de structures dynamiques de segments textuels pour l'analyse de corpus</title>
<author>
<name sortKey="Daoust, Francois" sort="Daoust, Francois" uniqKey="Daoust F" first="François" last="Daoust">François Daoust</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-202931" status="VALID">
<idno type="IdRef">168612100</idno>
<idno type="RNSR">201220083G</idno>
<orgName>Edition, Littératures, Langages, Informatique, Arts, Didactique, Discours - UFC</orgName>
<orgName type="acronym">ELLIADD</orgName>
<desc>
<address>
<addrLine>30 rue Mégevand, 25030 Besançon cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-fcomte.fr/pages/fr/menu1/recherche/la-recherche-a-l-ufc/ea-4661---elliadd-18229-17558.html</ref>
</desc>
<listRelation>
<relation name="EA4661" active="#struct-458810" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle name="EA4661" active="#struct-458810" type="direct">
<org type="institution" xml:id="struct-458810" status="VALID">
<idno type="IdRef">026403188</idno>
<idno type="ISNI">0000 0001 2188 3779 </idno>
<orgName>Université de Franche-Comté</orgName>
<orgName type="acronym">UFC</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-fcomte.fr</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city" wicri:auto="siege">Besançon</settlement>
<region type="region" nuts="2">Franche-Comté</region>
</placeName>
<orgName type="university">Université de Franche-Comté</orgName>
<orgName type="institution" wicri:auto="newGroup">Université de Bourgogne Franche-Comté</orgName>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">HAL</idno>
<idno type="RBID">Hal:tel-00870410</idno>
<idno type="halId">tel-00870410</idno>
<idno type="halUri">https://tel.archives-ouvertes.fr/tel-00870410</idno>
<idno type="url">https://tel.archives-ouvertes.fr/tel-00870410</idno>
<date when="2011-01-10">2011-01-10</date>
<idno type="wicri:Area/Hal/Corpus">000005</idno>
<idno type="wicri:Area/Hal/Curation">000005</idno>
<idno type="wicri:Area/Hal/Checkpoint">000023</idno>
<idno type="wicri:explorRef" wicri:stream="Hal" wicri:step="Checkpoint">000023</idno>
<idno type="wicri:Area/Main/Merge">000037</idno>
<idno type="wicri:Area/Main/Curation">000037</idno>
<idno type="wicri:Area/Main/Exploration">000037</idno>
<idno type="wicri:Area/France/Extraction">000019</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Data-processing modeling of dynamic structures of textual segments for the analysis of corpus</title>
<title xml:lang="fr">Modélisation informatique de structures dynamiques de segments textuels pour l'analyse de corpus</title>
<author>
<name sortKey="Daoust, Francois" sort="Daoust, Francois" uniqKey="Daoust F" first="François" last="Daoust">François Daoust</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-202931" status="VALID">
<idno type="IdRef">168612100</idno>
<idno type="RNSR">201220083G</idno>
<orgName>Edition, Littératures, Langages, Informatique, Arts, Didactique, Discours - UFC</orgName>
<orgName type="acronym">ELLIADD</orgName>
<desc>
<address>
<addrLine>30 rue Mégevand, 25030 Besançon cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-fcomte.fr/pages/fr/menu1/recherche/la-recherche-a-l-ufc/ea-4661---elliadd-18229-17558.html</ref>
</desc>
<listRelation>
<relation name="EA4661" active="#struct-458810" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle name="EA4661" active="#struct-458810" type="direct">
<org type="institution" xml:id="struct-458810" status="VALID">
<idno type="IdRef">026403188</idno>
<idno type="ISNI">0000 0001 2188 3779 </idno>
<orgName>Université de Franche-Comté</orgName>
<orgName type="acronym">UFC</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-fcomte.fr</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city" wicri:auto="siege">Besançon</settlement>
<region type="region" nuts="2">Franche-Comté</region>
</placeName>
<orgName type="university">Université de Franche-Comté</orgName>
<orgName type="institution" wicri:auto="newGroup">Université de Bourgogne Franche-Comté</orgName>
</affiliation>
</author>
</analytic>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="mix" xml:lang="en">
<term>Computer Aided Text Analysis</term>
<term>Discourse analysis</term>
<term>SATO model</term>
<term>Structural annotation</term>
<term>Textometry</term>
</keywords>
<keywords scheme="mix" xml:lang="fr">
<term>Analyse de discours</term>
<term>Analyse de texte assistée par ordinateur</term>
<term>Annotation structurelle</term>
<term>Modèle SATO</term>
<term>TEI</term>
<term>Textométrie</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">The objective of the thesis is to propose a data-processing model to represent, build and exploit textualstructures. The suggested model relies on a «type/token» form of text representation extended bysystems of lexical and contextual annotations. This model's establishment was carried out in the SATOsoftware -- of which the functionalities and the internal organization are presented. Reference to anumber of works give an account of the development and use of the software in various contexts.The formal assumption of the textual and discursive structures find an ally in the beaconing XMLlanguage and the proposals of the Text Encoding Initiative (TEI). Formally, the structures built on thetextual segments correspond to graphs. In a development driven textual analysis context, these graphsare multiple and partially deployed. Their resolution, within the fastening of the nodes to textualsegments or that of other graphs, is a dynamic process which can be sustained by various dataprocessingmechanisms. Examples drawn from textual linguistics are used to illustrate the principles ofstructural annotation. Prospective considerations for the data-processing establishment of amanagement system of the structural annotation are also exposed.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>France</li>
</country>
<region>
<li>Franche-Comté</li>
</region>
<settlement>
<li>Besançon</li>
</settlement>
<orgName>
<li>Université de Bourgogne Franche-Comté</li>
<li>Université de Franche-Comté</li>
</orgName>
</list>
<tree>
<country name="France">
<region name="Franche-Comté">
<name sortKey="Daoust, Francois" sort="Daoust, Francois" uniqKey="Daoust F" first="François" last="Daoust">François Daoust</name>
</region>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Ticri/explor/TeiVM2/Data/France/Analysis
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000019 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/France/Analysis/biblio.hfd -nk 000019 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Ticri
   |area=    TeiVM2
   |flux=    France
   |étape=   Analysis
   |type=    RBID
   |clé=     Hal:tel-00870410
   |texte=   Data-processing modeling of dynamic structures of textual segments for the analysis of corpus
}}

Wicri

This area was generated with Dilib version V0.6.31.
Data generation: Mon Oct 30 21:59:18 2017. Site generation: Sun Feb 11 23:16:06 2024